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ABSTRACT 


In this thesis, a hybrid tossless compression model is tested which employs a 
combination of both a lossy compression method and one or more tossless image 
compression methods to produce an overall lossless image compression. The hybrid 
model decomposes the original image into a browse and a residual image. The hybrid 
model is tested and evaluated using various combinations of lossy and lossless image 
compression methods. The lossy compression method used in the model is JPEG (Joint 
Photographic Experts Group). The lossless compression methods used are Huffman. 
Arithmetic, LZW, lossless JPEG, and Diagonal coding. The compression results 
achieved using the hybrid compression model are compared to the compression achieved 
using the corresponding direct lossless compression. Additionally, the hybrid model is 
evaluated as to the advantages that the decomposition of the image into browse and 


residual images provide to the user. 
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I INFRODUCTION 


A. REVIEW OF LIFERATURE 

Memory requirements to store, transmit, and display images have rapidly grown 
as the need for higher resolution images has increased. As a result of this explosion of 
data associated with images, various tmage compression algorithms have been 
developed. These compression algorithms capitalize on the redundancies tnherent in 
images to reduce the number of bits required to represent them. This results tn savings in 
the memory needed for image storage or in the channel capacity required for image 
Gemmsniaission [1], {2}, [3]. 

Image compression can be divided into two groups: lossy and lossless. Images 
may be compressed using a lossy or lossless compression method depending on the 
amount of compression and image resolution desired by the user. Lossy compression 
methods achieve high compression but produce an image which ts of lower resolution 
than the original image. Lossless compression methods achieve low compression but 
produce an exact replica of the original image. 

Some of the standard lossless compression methods are Huffman [1], [2], 
eaimimetic [1], [2], the Ziv and Lempel algorithms [2], [4], Predictive encoding [1], [2], 
Bit-plane encoding [1], [5], and Run-length encoding [6]. Each of these compression 
methods have many variations which are reported in the literature. A non-standard 


lossless compression method is Diagonal coding [7]. Lossy compression methods consist 


primarily of the Joint Photographic Experts Group (JPEG) algorithm [1], [2], [8], [9] and 
Fractal encoding [4], [10]. 

Comparisons of the performance of lossy and lossless compression methods 
reveal that some compression methods achieve better performance results in terms of 
compression ratios and root mean square error than others [11]. 

Lossy and lossless methods may be combined together to produce a lossless 
compressed image. Such an arrangement takes advantage of the high compression ratios 
achieved by the lossy methods and the error-free compression of the lossless methods 


[1], [LO]. 


BK. OVERVIEW OF THE THESIS 


The current chapter introduces the literature used itn the thesis and discusses the 
structure of the thesis. The various methods of image compression are presented. 

Chapter [I describes the goal of the thesis and discusses the proposed hybrid 
lossless compression model. The decomposition of an image into browse and residual 
images is introduced. The evaluation criteria used to evaluate the hybrid model and the 
lossless and lossy compression methods 1s defined. 

Chapter HI describes the lossy and lossless compression methods used to evaluate 
the hybrid lossless compression model. The lossy algorithm used in the evaluation ts the 


lossy JPEG. The lossless algorithms used in the evaluation are Huffman, Arithmetic, 


LZW, and lossless JPEG. 


Chapter IV discusses the results of using secondary compression to compress the 
lossy compressed image in the hybrid model. A comparative analysis of the compression 
achieved using direct lossless compression and the compression achieved using the 
hybrid lossless compression model is performed for each of the lossless compression 
methods. 

Chapter V introduces another lossless compression method called Diagonal 
coding. It is compared to the other lossless compression methods evaluated in the hybrid 
model. 

The optimization of the model 1s discussed in Chapter V1 with emphasis on the 
combination of lossy and lossless compression methods that result in high overall 
compression and a visually acceptable browse image. 

Chapter VII contains the general conclusions reached from the comparative 
analysis of Chapters IV, V, and VI. 

Appendix A contains the tabulated numeric data gathered during the research of 
the thesis. The source code for the Diagonal coding compression algorithm (encoding 


and decoding) is contained in Appendix B. 


ft. HYBRID LOSSLESS COMPRESSION MODEL 


In many practical situations involving images, a small degree of error in the pixel 
values can be tolerated without a significant effect on the displav. This suggests that 
there are advantages to a decomposition of images into a lossy component and an error 
component. A hybrid compression model which emplovs the browse and _ residual 
concept has recently appeared in the literature [7]. 

A hybrid image compression model was tested which utilizes both lossy and 
lossless image compression techniques to produce an overall lossless image compression. 
The model ts tested and evaluated using the JPEG algorithm, the industry standard for 
lossy image compression, and various popular lossless compression techniques. The total 
compression achieved using the model is compared to the compression achieved using 
standard lossless image compression techniques. Figure II.1 displays a block diagram of 
the hybrid lossless compression model. 

The model was evaluated using 8-bit (256 levels), 256x256 (65536 bytes) pixel 
grey-scale images in raw pixel grey map (rpgm) format. Three different images were 
used in the tests. The images were all structurely different from each other in order to 
test the model over a broad range of images. 

An image is first compressed using a lossy compression process. The lossy 
compression technique used in the testing of the model is the Joint Photographic Expert 
Group (JPEG) algorithm. After compression using JPEG, the compressed image is 


further compressed (secondary compression) using a lossless compression method. 


Several different lossless compression algorithms are tested and evaluated. The lossy 
compressed image 1s decompressed and compared on a pixcl by prxel basis to the original 
image. The decompressed image ts termed the browse image data as it can be used for 


browsing an image and preliminary analysis of the image. Browsing cnables a user to 
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Figure I1.1: Hybrid Lossless Compression Model 


determine whether a lossless representation of the original image ts required or rf the 
lossv browse tmage data 1s adequate for their needs. The difference between the original 
image and the decompressed tmage ts the error mage or residual rage. The residual 
image is compressed with a lossless compression routine. Once agatn, several different 
lossless compression algorithms are tested and evaluated. Lastly, the compressed browse 
image file and the compressed residual image file are appended together into a single 
file. Decoding consists of separating the appended file into the respective compressed 


browse and residual rmage files and applying the appropriate decompression algorithm to 


(a 


cach. Figure [1.2 displays a block diagram of the decoding process. Both the browse and 
residual image files are first decompressed using the same lossless compression routines 
Which were used to compress them. The resulting browse tmage file is then 
decompressed using the lossy JPEG algorithm. The residual image file is added on a 
pixel by pixel basis to the decompressed browse image file to obtain the original image. 
The hybrid lossless compression model combines the inherent advantages of both 
lossy and lossless compression algorithms to achieve the lossless result. The high 
compression achieved by the lossy JPEG algorithm combined with the error-free lossless 
algorithms results ina significantly compressed image, which upon decoding, ts an exact 


replica of the original image. 
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Figure II.2: Diagram of Decoding Process 


Various combinations of the lossy JPEG and lossless algorithms were evaluated 


in the model and compared. The evaluation criteria used in the comparisons was the total 


compression ratio (CR) achieved using a particular lossless compression algorithm in 
combination with the JPEG lossy algorithm. Compression ratio 1s the percent 


compression achieved as a result of compressing a file [2, p. 10]: 
CR = (1 - (Compressed Image Size / Original Image Size)) x 100. abies) 


A file whose file size does not change when compressed will have a compression ratio of 
Q) percent. A file which is compressed to one-third of its origmal size will have a 
compression ratio of 67 percent. Therefore. perfect compression occurs at 100 percent. 
A file whose compressed file size 1s greater than its orginal file size will have a negative 
compression ratio. The overall compression ratio achieved by the hvbrid lossless 
compression model is a combination of the compressed browse image CR and _ the 
compressed residual image CR. Application of Equation II.1 to browse, residual, and 


overall compression ratios lcads to: 


CIN 7 [CR 7 50] + [CR eeidual if 50] ITZ) 


overall browse 


where CR and CR, 


are the compression ratios of the compressed browse and 


browse esidual 


residual images. 

The overall compression ratios achieved using the model with different 
combinations of lossless techniques and JPEG are compared with each other and with the 
compression ratios achieved using standard lossless compression — techniques. 
Additionally, the benefits of breaking up the tmage into browse and residual images are 


evaluated and compared to the standard lossless compression methods. 


Hi. COMPRESSION TECHNIQUES 


A. LOSSLESS AND LOSSY TECHNIQUES 

Compression techniques can be divided into lossless and lossy methods. A 
lossless method always produces a decompressed image that 1s identical. pixel-for-pixel, 
to the original image. On the other hand, lossy methods produce a decompressed image 
that is not identical to the original image. The degree of difference between the lossy 
decompressed image and the original image depends upon the compression ratio desired. 
The higher the compression ratio, the greater the difference between the decompressed 
image and the original image. Lossless compression methods typically attain small 
compression ratios of about 50% or less while lossy methods can achieve much higher 


compression ratios. 


B. HUFFMAN CODING 

Huffman coding is a lossless compression method that assigns variable-length 
codes to symbols based on the probability of each symbol's occurrence in a file. It 1s 
based on the premise that if the probability of symbols in a file are known, and the 
probability distribution is a non-uniform distribution, variable-length codes can be 
assigned to each symbol which will result in compression of the file. When using this 
type of coding, a symbol that has a very high probability of occurrence generates a code 
with very few bits. A symbol with a low probability generates a code with a larger 
number of bits. Generating codes that vary in length according to the probability of the 


symbol they are encoding makes data compression possible. Each variable-length code 


can be uniquely decoded. Huffman coding achieves the minimum amount of redundancy 
possible in a fixed set of variable-length codes: however. this doesn't mean that [luffman 
coding ts an optimal coding method. It means that it provides the best approximation for 
coding symbols when using fixed-length codes [2, p. 18]. 

A binary tree is constructed from the individual symbols ina file. Each symbol is 
a child node in the tree. “A weight is assigned to cach child node. The assigned weight is 
either the frequency or the probability of the symbol occurring in the file. Therefore, 
symbols with a low probability of occurrence have lower weights assigned. The binary 
tree is built by combining the two lowest weight child nodes, creating a parent node. and 
assigning a weight to the parent node. The parent node's assigned weight 1s the sum of 
the two child node weights. A bit value of 1 is assigned to the path taken from the parent 
node to the child node with the lowest weight. The path from the parent node to the 
other child node ts assigned a bit value of 0. The process is repeated until only one node 
is left. This node is designated the root of the binary tree. The variable-length codes are 
generated by traversing the binary tree from the child node which represents the symbol 
of interest to the root. The bits tn the generated code are arranged in the order from root 
to child node. Table IJI.1 contains a list of five different symbols and their frequency of 
occurrence ina file. The table also contains the untque variable-length Huffman codes 
assigned to each symbol. Figure III.1 displays the Huffman binary tree for the file in 
Table III.1. Huffman codes have the untque prefix attribute, meaning that no code ts a 


prefix to another code. As a result, the codes can be correctly decoded despite berng 


variable length. Using Figure III.1, the Huffman code for 'MAZES' would be 


1101001110101 or 13 bits long. [feach letter in 'MAZES' requires eight bits to represent 
it, then a total of 40 bits would be required. In this case, Huffman coding produces a 


compression ratio of 67.5%. 





Table L.1: Huffman Coding Exampie 





Figure IfT.1: The Huffman Binary Tree. 


Huffman coding uses an integral number of bits for each code. which ts usually 
slightly less than optimal. Additionally, the compression program has to pass a complete 


copy of the Huffman coding statistics with the compressed data. This effectively reduces 


the amount of compression achieved. |luffman coding ts not an optimal coding method. 


but it is the best approximation that uses fixed codes with an integral number of bits. 


ce ARITHMETIC CODING 

Arithmetic coding 1s a lossless compression method that produces a single output 
code for an entire message. Unlike Huffman coding, tt does not produce a single code 
for each svmbol. Instead, arithmetic coding encodes a stream of input svmbols with a 
single floating-point output number in the range from 0 to 1. Each symbol added to the 
message incrementally modifies the output code. «As in Huffman coding, cach svmbol's 
probability of occurrence tn the file ts first determined. Next. cach svmbol ts assigned a 
range, corresponding to its probability of occurrence. in the interval from 0 to 1. Table 
I.2 contains a file with five different symbols, their probability of occurrence, and the 
range they occupy in the 0 to | interval. If the first symbol in the file 1s 'M', then the 
encoded floating-point output number will be a number between 0.60 and 0.70. [ach 
new symbol to be encoded further restricts the range of the output number. If the next 
symbol to be encoded ts 'A', then the encoded output number will be a number between 
0.60 and 0.62 since 'A' is assigned the range 0.00 to 0.20 in the 0.60 to 0.70 subrange 
established by the symbol 'M'. The higher the probability of a symbol, the less it will 
reduce the range and, therefore, add fewer bits to the code. The net effect of cach input 
symbol on the output code can be a fractional number of bits instead of an integral 
number since Arithmetic coding uses a fractional number of bits per code allowing 1t to 


incrementally improve compression performance. Table III.3 contains an example of the 


11 


Arithmetic encoding process resulting in the final low value, 0.61896, which will 
uniqucly encode the message 'MAZES'. The svmbol probabilities are taken from Table 
(11.2. The number of bits required to represent the number 0.61896 can be determined 


from: 


0.61896 = YA, (II.1) 
1 Z 


where A, ts the ith bit of the binary representation of 0.61896, 1 1s the index of the nth 
bit, and x is the minimum number of iterations necessary to repesent the number in 
binary. Selecting x to be a value of 20 ensures that the left side of Equation II.1 will 
have sufficient resolution in order to represent 0.61896. Therefore, 0.61896 can be 
represented in as few as 20 bits compared to the 40 bits required to represent the message 
'MAZES' using cight bits per character. This results in a CR of 50%. <A _ simple 
algorithm can be applied to Equation III.1 in order to produce the sequence A,, A,, ...Ayo- 
Simply multiply the left side of Equation III.1 by 2 repeatedly until an integer ts 
produced as a leading digit. Then subtract one and continue. For each | produced record 


a one, otherwise record a zero [12, p. 9]. 






Table H1.2: Arithmetic Coding Symbol Range 
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Table IH.3: Arithmetic c Encoding ae 


Decoding consists of determining which symbol falls within the range of the 
encoded message. In the example in Table III.3, the encoded message falls in the 
interval between 0.6 and 0.7. Therefore, the first character in the message must be 'M', 
The next character is decoded by subtracting the low value of 'M' from the encoded 
valuc, dividing by the width of the range of 'M' (0.1), and determining which character 


falls within the new interval. Table III.4 contains an example of the decoding process. 


Encoded j|Output Symbol High Range 
so 





Table IHI.4: Arithmetic Decoding Example. 


D. LIMPEL-ZIV (LZ) COMPRESSION 

LZ compression 1s a lossless compression method based on the work of Jacob Ziv 
and Abraham Lempel in 1977-1978 [2, p. 23]. It is a dictionary-based method using an 
adaptive dictionary to achieve compression. LZ compression is based on strings of 
symbols instead of individual symbols thereby exploiting the interdependency between 
svmbols in a string. «A table of strings 1s created from the input data and placed into a 
string dictionary. As each new string 1s input from the input data. the string dictionary ts 
searched for a string match. If a match 1s found, a code is output which represents the 
string in the dictionary. 

Ziv and Lempel's work resulted in two LZ compression methods, LZ77 and 
LZ78. LZ77 uses a sliding-window approach in constructing its dictionary. The 
dictionary consists of all the strings in a window of the input data stream. For example, 
ifa4K byte window ts used as the dictionary, the LZ77 algorithm looks for matches with 
strings found in the previous 4K bytes of data already read in. As new symbols of the 
input data stream are read in, the 4K byte window slides so that the last 4K bytes of input 
data is in the window, hence the term sliding-window. All string matches are encoded as 
pointers to the string in the dictionary. The amount of compression depends on how long 
the dictionary strings are and how large the sliding-window is. Figure III.2 shows a 
simple flowchart of the LZ77 compression process. 

LZ78 differs from LZ77 in the way that it builds and maintains its dictionary. 
Instead of having a limited-size window of the preceding input data, LZ78 builds its 


dictionary out of all of the previously input svmbols in the input data stream. The 
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dictionary of strings ts bunlt a single symbol at a time. The first symbol input from the 
input data stream 1s stored in the dictionary and becomes the current prefix. Each 
subsequent symbol from the input 1s added to the current prefix before a search for a 
match 1s made in the string dictionary. If a string match 1s found, a pointer code ts 
output which represents an offset into the string dictionary. If no match ts found. the 
stirng ts added to the dictionary. Once a string ts added to the dictronary, it ts available 
to the encoder at all (mes, not just for the next few thousand characters as in LZ77. This 
mercmental procedure works very well at tsolating frequently uscd strings and adding 
them to the dictionary. Consequently, strings in LZ78 can be very long, resulting in high 
compression ratios. 

Another variation of LZ compression is the LZW compression method. 
developed by Terry Welch in 1984 [2, p. 285]. LZW 1s an extension of LZ78. LZW 
differs from LZ78 in the way that it intially builds the dictionary. The dictionary 1s 
mitialized with single-symbol strings equal to the number of ASCII characters. In other 
words, the first 256 entrics in the dictionary are intialized with the byte valucs (0 to 255. 
Thus, there 1s no symbol that cannot be immediately cncoded cven if it has not already 
appeared in the input data stream. LZW uses a current prefix buffer and a current string 
buffer lke the LZ78 algorithm. The current string ts defined as the current prefix plus 
the next symbol input from the input data. A match 1s found for the first symbol. A 
code ts output, and the new string ts added to the string table. The current string ts added 


to the current prefix. This process continues until the input data stream ends. 
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Figure EHf.2: Flowchart of LZ77 Compression Process. 


ae RUN LENGTH ENCODING 

Run length encoding (RLE) ts arguably the least complex and easiest to 
implement lossless compression method. RLE capitalizes on the successive repetition of 
characters in a binary bit stream or image. It is effective only in applications involving 
many repeated characters. Instead of repeating each character, run length encoding uses 
a code which specifies how many consecutive characters are in the particular run. In the 
case of images, many consecutive grey-scale pixcls having the same value are an 
example in which run length encoding would produce some degree of compression. Run 
length encoding may actually expand a file if the average length of consecutive 


characters 1s less than the code used to specify them. Run length encoding can be 
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performed at the byte or the bit level depending on the application. It is used most often 


as a preprocessor for other compression algorithms [6, p. 37]. 


F. BIT PLANE ENCODING 

Bit plane encoding 1s the process of grouping single bits from the same position 
in a binary represcntation together to form a binary array. For cxample, an image 
containing N x N pixels, cach pixel represented by k bits, can be broken up into k 
different N x N bit planes. The most significant bit (MSB) of cach pixel binary 
representation is grouped together with the MSB of the remaining pixels to form a bit 
plane. Repeating this process for the other k-1 bits in each pixel results in k bit planes. 
Hence, the original image 1s now represented by k, N x N bit planes. The advantages of 
bit plane encoding are twofold. First, each individual bit plane can be encoded 
efficiently using a lossless compression routine. Secondly, bit plane encoding permits a 
technique called progressive transmission to be implemented. In progressive 
transmission, bit planes are transmitted in a sequence starting with the MSB bit plane and 
ending with the LSB bit plane. The transmitted bit planes are progressively 
reconstructed at the terminal end. The user may view an image as it 1s being 
reconstructed and elect to terminate the transmission or procced depending on the level 
of quality desired [1, p. 194]. 

The most significant bit planes tend to contain a lot of redundancy and are highly 


compressible. The least significant bit planes contain less redundancy and exhibit the 


uF 


behavior of random noise. As a result, the Icast significant bit planes are less 
compressible than the more significant bit planes [1. p. 54]. 

Bit planes may be combined together into subsets [5. p. 35]. Each subset may 
then be compressed with a lossless compression method. Grouping the bit planes into 
subsets may achieve higher compression ratios than performing lossless compression on 
each individual bit plane. The distributions of each bit plane or subset with respect to bit 
values of one and zero determines which optimum lossless compression technique to 


utilize for compression. 


G. PREDICTIVE ENCODING 


Predictive encoding may be either a lossless or a lossy compression method. The 
lossless predictive cncoding method is discussed here. Images are typically highly 
correlated from pixel to pixel, especially between adjacent pixels. This correlation 
between pixel values can be exploited to achieve compression of the image by using 
predictive encoding. Predictive encoding predicts the value of a given pixel based on the 
values of the pixels surrounding it. Numerous combinations of pixels exist. After 
predicting the value of the pixel, the predicted pixel value is subtracted from the actual 
pixel value to form an error value. This process 1s continued for all of the pixels in the 
image. The resulting error image will have a significantly different distribution or 
histogram than the original image. If the predictor accurately predicts the pixel values, 
the error will be small and the error image histogram will be narrow and Laplacian in 


nature [l, p. 62]. The error 1s encoded using a lossless compression method such as 


Is 


Huffman or Arithmetic. The better the predictor ts at predicting the pixel valucs, the 
smaller the resulting error. The smaller error can be encoded more efficiently, resulting 
in better overall compression of the image. The order of the predictor 1s determined by 
the number of surrounding pixcl values used to make the prediction. Generally, a higher 


order predictor will outperform a lower order one [1]. pp. 58-60]. 


fh JPEG 

JPEG 1s a compression standard created by the Joint Photographic Experts Group 
(JPEG). The JPEG compression standard has not vet been finalized but ts currently in 
the final stages of the standardizations process. The JPEG standard includes a 
specification for both lossy and lossless compression of images. The Discrete Cosine 
Transform (DCT) algorithm with quantization 1s used for lossy compression and a 
predictive method is used for lossless compression. The JPEG encoder consists of three 
stages: a transformation stage, a lossy quantization stage, and a lossless coding stage. 
The advantages of the DCT over the Discrete Fourier Transform (DFT) lic in the 
differences in their periodicities [1, pp. 108-111]. The DCT transformation stage 
converts the image to the frequency domain and concentrates the information energy into 
the first few transform coefficients, the quantization stage causes a controlled loss of 
information, and the lossless stage further compresses the image data. Figure [1.3 


displays a block diagram of the JPEG encoder. The DCT equation for an NxN pixel 


block 1s: 
i. ee an a oe | a + ve 
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Decoding consists of reversing the process and using the Inverse Discrete Cosine 
Transform (IDCT) in place of the DCT. The IDCT equation is: 
— Qm+ iin | | Gut lyr | 
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The image is a three dimensional signal (graphical image) where the x and v axes 
are the two dimensions of the screen, and the z axts is the amplitude or value of a pixel. 
This is the spatial representation of a signal. The two dimensional DCT 1s obtained by 
performing a one dimensional DCT on the columns followed by a one dimensional DCT 


on the rows [2, pp. 356-357]. 





8x8 Pixel DCT : Lossless Compressed 
Blocks Transformation Quantizer Encoder Image 





Figure III.3: Block Diagram of the JPEG Encoder. 
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The orginal image ts first partitioned into 8x8 pixct blocks. Each block ts 
independently transformed using the DCT. Each 8x8 pixel block has video cnergy 
distributed amongst its pixel clements. This video energy may be of low spatial 
frequency (slowly varying) or of high spatial frequency (quickly varying) [9, p. 5]. The 
DCT converts the spatial information into frequency or spectral information, with the x 
and y axes representing frequencies of the signal in two different dimensions. The 
transformed output of the 2-D DCT 1s an 8x8 array of 63 AC coefficients and 1 DC 
coefficient. The DC coefficrent 1s the mean value of the array and 1s located in the upper 
left corner. The AC coefficients are ordered such that the lower frequency coefficients 
are located near the DC coefficient with the higher frequency coefficients located awav 
from the DC coefficient. The DC coefficient always has the highest value of all the 
coefficients. Most images are composed of low frequency information. This suggests 
that the DC and lower frequency coefficients carry more uscful information about the 
image than the higher frequency coefficients. As a result, the ordering of the cocfficients 
in the array 1s significant. As we move farther away from the DC coefficient in the 
array, we find that the coefficients have lower values and become far less important for 
describing the image [2, p. 359]. An example of the effects of DCT processing on an 
8x8 pixel block 1s shown in Figure [II.4. 

The quantization stage of the JPEG encoder quantizes the coefficients of the DCT 
transform array to reduce their magnitude and to increase the number of zero value 
coefficients. Quantization is the lossy stage in the JPEG encoder. The degree of 


quantization ts controlled by a variable called the quality factor. The quality factor 1s a 


number which changes the default quantization matrix by an effective multiplicative 
factor of the quality factor. Each of the DCT coefficients 1s divided by the corresponding 
quantizing value in the quantization matrix and rounded to the nearest integer. The 
greater the number of high frequency (lower information content) AC coefficients 
converted into zeros, the greater the compression achieved by the subsequent lossless 
cncoding stage. Consequently, a higher quality factor results in better compression while 
a lower quality factor results in a better quality image upon decompression. A sample 
quantization matrix is shown in Figure UI.5. Figure IIl.6 displays a sample DCT 
transformed image before and after quantization. 

Prior to the final lossless encoding stage, the quantized DCT coefficients are 
arranged in a zig-zag pattern (see Figure III.7) with the lowest frequencies first and the 
highest frequencies last. The numbers 1-64 in Figure IJI.7 represent the sequence that 
the pixels are placed in the output sequential bit stream. This type of pattern is used to 
increase the number of consecutive zero coefficients in the 8x8 block. This allows for 
further compression using a lossless method such as run length encoding, Huffman or 
Arithmetic [8, p. xxi]. 

The lossless encoder encodes the 8x8 pixel block DC coefficients using 
Differential Pulse Code Modulation (DPCM). DPCM encodes the difference between 
the quantized DC coefficient of the current block and the quantized DC coefficient of the 
previous block. The AC coefficients are coded using a combination of run length 


encoding and Huffman. 
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Figure [1I.4: Sample Image Data before and after DCT 
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Processing [2, p. 363]. 
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Figure HI.5: Sample Quantization Matrix [ 


.p. 367] 


The JPEG compression standard also contains a_ lossless compression 
specification based on predictive encoding and Huffman. This lossless mode of 
operation 1s wholly independent of the DCT processing previously discussed. The 
lossless JPEG predictive encoder has seven different predictors to choose from. The 
seven different predictor models combine the values of up to three neighboring pixels (A, 
B, and C) to predict the current pixel value (X) in Figure {II.8. This prediction is then 
subtracted from the actual pixel value, and the difference 1s encoded losslessly using 
Huffman. Any one of the seven predictors (K =:1 - 7) listed in Table III.5 can be used. 
The K = 1, 2, and 3 predictors are one dimensional predictors while the K = 4, 5, 6, and 7 


predictors are two dimensional predictors. 
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DCT Matrix before Quantization: 


DCT Matrix after Quantization: 
90) 0) 
-35 -56 
-84 54 
-45 -33 
-77 -39 


-15 
-5] 





Figure I1I.6: DCT Transformed Image before and after 
Quantization [2, p. 368]. 
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Table HI.5: Lossless JPEG Predictors. 


IV COMPARISON OF COMPRESSION METHODS 


A. OVERVIEW 

A comparison of the compression ratios achieved by direct compression of three 
test images using the standard lossless compression methods ts performed. .\ comparison 
of compression ratios 1s also performed when the standard lossless compression methods 
are tested in the hybrid lossless compression model. Additionally, the hybrid model 
compression results are compared to the direct compressions achieved by the standard 


lossless methods. 


B. TEST IMAGES 

Three different 8-bit, 256x256 (65536 bytes) pixel grey-scale images tn raw pixcl 
grey map format were used to evaluate the hybrid lossless compression model. The three 
test images are displaved in Figure [V.1. The first image, LENA, 1s an image whose 
pixel values range over most of the 256 possible grey-scale levels. The tmage contains 
sharp contrasts and edges. The second image, SHUTTLE, has a range of pixel values 
that is less than that of LENA. A small range of pixel values dominate the image. The 
image contains large areas where the pixel values do not change significantly, such as the 
plume from the rocket motors and the sky background. The third tmage, 
FINGERPRINT, is dominated by a more narrow range of pixel values. The image 
contains large areas of whitespace. Pixel values that are contained in an image and their 
frequency of occurence are plotted in a histogram. Histograms of each of the threc test 


images is displayed in Figure [V.2. As expected. LENA contains a wide range of pixcl 
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Figure IV.1: Three Test Images (a) LENA, (b) SHUTTLE, 
(c) FINGERPRINT. 
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Figure IV.2: Histograms of the Three Test Images (a) LENA, (b) SHUTTLE, 
(c) FINGERPRINT. 


values; SHUTTLE 1s dominated by a smaller range of pixel values: and FINGERPRINT 


contains a very narrow range of dominant pixel values. 


C. LOSSY JPEG 

The lossy JPEG algorithm used in the model was developed by Andy C. Hung at 
the Portable Video Research Group (PVRG), Stanford University (9]. The quality factor 
used when compressing an tmage determines the amount of compression achieved and 
the resolution of the image when it is decompressed. The higher the quality factor. the 
greater the compression and the less the resolution upon decompression. Figure [V.3 
graphically displays the quality factor versus compression ratio achieved for the three test 
images. The graph data ts tabulated in Table A.] in Appendix A. The measure of the 
resolution of the decompressed image as compared to the original image 1s termed the 


root mean square error (c,,,.) and 1s a measure of the error between the two images [3, pp. 


2o- 297 |: 
) | No} AGI : 0.5 
Cine = 47] Qa Ly [g(x y) - f(x.y)| (IV.1) 
N| x=0 5=0 


where, for NxN pixel images, f(x,y) 1s the array of pixel values for the original image 
while g(x,y) 1s the array of pixel values for the decompressed image. Figure IV.4 


graphically displays a plot of quality factor versus e_. for each of the three test images. 


rms 


The graph data is tabulated in Table A.2 in Appendix A. As the quality factor is 


increased, the ec, of the decompressed image decreases as expected. The decompressed 
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Figure [V.3: Comparison of Quality Factor vs CR for the Three Test 
lmages. 
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Figure IV.4: Comparison of Quality Factor vs e_,,, for the Three 
Test lLnuages. 
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test image LENA 1s displayed in Figure [V.5 after compression at various quality factors. 
Note that as the quality factor increases, the resolution of the decompressed tmage 
decreases. At quality factors greater than 100, the decompressed image begins to exhibit 


distinct blockiness due to the processing of 8x8 pixel blocks by the JPEG algorithm. 


D. SECONDARY COMPRESSION 

The hybrid lossless compression model was first evaluated by assessing if it ts 
feasible, in terms of compression overhead, to use secondary compression to achieve a 
lossless process. In order for secondary compression to be feasible. 11 would have to 
contribute some measureable increase in the compression achieved alter compressing an 
image using lossy JPEG. The lossless compression methods used for secondary 
compression are Huffman, Arithmetic, and LZW and the code 1s taken from Nelson [2]. 
Table IV.1 contains the results of secondary compression on the three test tmages first 
compressed using lossy JPEG at different quality factors. The results are expressed as 
the percent compression ratio (CR) achieved. The results show that secondary 
compression does not significantly increase the compression of the three test images 
used. In fact, in all but a few cases, secondary compression of the lossy JPEG 
compressed image resulted in an expansion (1.e., negative CR) of the compressed image 
file size instead of compression. Since secondary compression does not provide a 
significant reduction in the compressed image file size, the hybrid lossless compression 
model was modified accordingly. The modified hybrid lossless compression model 1s 


displayed in Figure IV.6. 
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Figure IV.5: Decompressed LENA at Various Quality Factors (a)Original 
Image, (b) Q=100, (c) Q=250, (d) Q=350, (e) Q=500, 
(f) Q=800. 
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Table IV.1: aa a CR Results for the Three Test Images. 
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Figure IV.6: Modified Hybrid Lossless Compression Model. 


E. COMPARISON OF LOSSLESS COMPRESSION METHODS 

The three test images were first compressed using standard lossless compression 
methods in order to provide a reference to compare the compression results achieved by 
the hybrid lossless compression model. Huffman, Arithmetic, LZW, and lossless JPEG 
were the lossless compression methods used. Once again, the Huffman, Arithmetic, and 
LZW algorithms are taken from Nelson [2]. The lossless JPEG algorithm is taken from 
Andy C. Hung's PVRC-JPEG algorithm [9]. The direct lossless compression results 
achieved are graphically displayed in Figure ITV.7. All seven of the lossless JPEG 


predictor algorithms were used in the compression test, but only the predictor algorithm 
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Which achieved the best results is graphed in Figure [V.7. For all three test images. 
lossless JPEG achieved greater compression than the other three lossless compression 
methods used. Nonetheless, the lossless JPEG does not provide the convenience of a 
browse and residual decomposition. 

The hybrid lossless compression model (Figure 1V.6) was first evaluated using 
standard lossless compression techniques. Another lossless method. Diagonal coding, 
recently reported in the literature. will be discussed in the context of the hybrid model in 
Chapter V. Huffman, Arithmetic, LZW, and lossless JPEG were used to compress the 
residual image ((B) shown in Figure 1V.6). The results achieved after compressing the 
three test images using the hybrid lossless compression model with Iluffman, Arithmetic. 
LZW, and the lossless JPEG methods are graphically displayed in Figures IV.8, IV.9, 
and 1V.10 respectively. The test images were compressed at various quality factors. The 
lossless JPEG predictor algorithm that achieved the greatest compression of each residual 
image 1s graphed. The second and third order predictor algorithms (K=4, 5, 6, 7) 
predominantly achieved the greatest CR on the residual images and are 4Hidentified 
Table A.3 in Appendix A for each image. 

A comparison between the compression results achieved by the direct lossless 
compression methods and the hybrid lossless compression model 1s graphically displaved 
in Figures 1V.11, 1V.12. and IV.13 for cach of the three test images at various quality 
factors. For ease of reading, it should be noted that the right-most 3-D bar in cach 
column represents the compression achieved compressing the image with that particular 


direct lossless compression method (not using the hybrid lossless compression model). 
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F. CONCLUSIONS 3 

The lossless JPEG algorithm achieves the greatest compression on cach of the 
three test images when compared to the other three direct lossless compression methods. 
The lossless JPEG predictor algorithm which achieved the greatest compression was 
K=2, K=6, and K=5 for the test images LENA, SHUTTLE, and FINGERPRINT, 
respectively, and achieved compression ratios of 34%, 49%, and 27% (see Figure [V.7). 
The highest compression ratio achieved by the other three direct lossless compression 
methods for cach of the three images was 8%, 18%, and 13% (sce Figure IV.7). 

The hybrid lossless compression model achieved its best compression results on 
the test 1mage LENA when the arithmetic method was used to compress the residual 
image. The best overall compression was achieved using a quality factor of 100 to 
compress the original image with lossy JPEG. A hybrid compression ratio of 31% was 
achieved compared to direct Huffman (7%), Arithmetic (7%), LZW (-3%), and lossless 
JPEG (34%) (see Figure IV.8). 

The hybrid model achieved its greatest compression on SHUTTLE when using a 
quality factor of 50 to compress the original image using lossy JPEG and the arithmetic 
method to compress the residual image. A hybrid compression ratio of 48% was 
achieved using this combination compared to direct Huffman (16%), Arithmetic (16%), 
LZW (18%), and lossless JPEG (49%) (see Figure IV.9). 

The greatest compression was achieved on FINGERPRINT when using a quality 


factor of 50 in combination with the arithmetic method. A hybrid compression ratio of 
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31% was achieved compared to direct Huffman (13%), Arithmetic (13%), LZW (13%). 
and lossless JPEG (27%) (cm Figure 1V.10). 

In all cases, the hybrid lossless compresston model achieved greater compression 
ratios on all three test mages than did the direct lossless compression methods with the 
exception of the direct lossless JPEG method. Due to the wide diversity of images 
compressed using the hybrid model, these results suggest that the hybrid model will 
achieve similar favorable compression results on any grey-scale image. The hvbrid 
model achieved a Iesser CR on LENA and SHUTTLE than did direct lossless JPEG: 
however, the model did achieve a greater CR than direct lossless JPEG on 
FINGERPRINT at quality factors of 50 and 100 (sce Figure [V.10). The hybrid modcl 
enjoys the advantage of producing a compressed browse image which ts significantly 
more compressed than the direct lossless JPEG compressed image. For instance, using a 
quality factor of 100 to compress LENA produces a lossy compressed browse tmage with 
a file size of 4823 bytes (compression ratio of 92%). The best lossless JPEG predictor 
algorithm produccs a direct lossless compressed file size of 43322 bytes (compression 
ratio of 34%). Decompressing the lossy compressed LENA browse image produces an 
image that ts visually lossless with no visual distortions (sce Figure IV.5 (b)). If a 
lossless image is desired then the residual tmage of 40353 bytes can be transmitted and 
added to the browse image to produce an cxact replica of the original image. 

In the next chapter, a recently discovered lossless method known as Diagonal 
coding [7] is discussed and tested. Comparison to the results of this chapter will be 


made. 
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Figure [V.7: Comparison of Direct Lossless Compression on the 
Three Test Images. 
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Figure 1V.8: CR Achieved Using the Hybrid Lossless Compression 
Model on LENA at Various Quality Factors. 
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Figure IV.9: CR Achieved Using the Hybrid Lossless Compression 
Model on SHUTTLE at Various Quality Factors. 
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Figure [V.10: CR Achieved Using the Hybrid Lossless Compression 
Model on FINGERPRINT at Various Quality Factors. 
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Figure [V.11: Comparison of Hybrid Lossless Compression Model 
with Standard Lossless Compression Methods tor 
LENA at Various Quality Factors. 
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Figure [V.12: Comparison of Hybrid Lossless Compression Model 
with Standard Lossless Compression Methods for 
SHUTTLE at Various Quality Factors. 
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Figure 1V.13: Comparison of Hybrid Lossless Compression Model 
with Standard Lossless Compression Methods for 
FINGERPRINT at Various Quality Factors. 
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V. DIAGONAL CODING 


ie INTRODUCTION 

Another lossless compression method 1s Diagonal coding. Although not a 
standard compression method, it is nonetheless a simple, easy to implement compression 
method which achieves good compression ratios when used to compress residual images 
resulting from the compression of the original image at low quality factors. The 
compression ratios achieved by Diagonal coding are compared to those attained by the 


other standard lossless compression methods. 


B. RESIDUAL IMAGE HISTOGRAM 

The residual image resulting from the pixel by pixel differences in the original 
image and the decompressed image exhibits a Laplacian distribution with a mean of zero. 
The residual image distribution, or histogram, has a reduced variance compared to the 
original image and is also significantly less correlated [1, p. 60]. The shape of the 
residual image histogram is dependent upon the quality factor used to compress the 
original image using lossy JPEG. As previously discussed in Chapter IV, the higher the 
quality factor used, the more compression achieved; however, the decompressed image 
will less resemble the original image. This results in a residual image containing a wider 
range of pixel values. As a result, the residual image histogram will exhibit a wider 
Laplacian distribution. Lossless compression routines which are designed to take 
advantage of this type of image distribution will achieve significant compression results. 


Figure V.1 displavs residual image histograms of LENA for various quality factors. Note 
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Figure V.1: Residual Image Histograms of LENA (a) Q=5. (b) Q=50, (¢) Q=500. 


that as the quality factor used to compress the original image of LENA ts increased, the 


distribution of the corresponding residual image widens. 


Cy DIAGONAL CODING 

Due to the residual image exhibiting a Laplacian distribution with a smaller 
variance of pixel values than the original image, a lossless compression method that 
employs variable length encoding should achieve significant compression of the data [7, 
pp. 9-10]. Diagonal coding 1s a type of variable length encoding designed to take 
advantage of the Laplacian distribution of the residual image. [n Diagonal coding, each 
pixel value is represented by the number of zeros corresponding to that value, terminated 
by a onc. Since higher pixel values in the residual image data occur less frequently than 
lower pixel values, the coding 1s optimal [7, p. 10]. As with other lossless compression 
methods, there are variations to Diagonal coding. One variation is to group residual 
image data values together into sets and assign a diagonal code to cach set. For example, 
a sect may consist of the four values -1, 0, 1, and 2. This set may be called set 0 and 
assigned the diagonal code of 1. An example of Diagonal coding using sets 1s displayed 
in Table V.1. During encoding, the diagonal code representing cach set 1s followed by 
two bits used to identify which value in the set 1s being encoded. For example, the 
combination of two bits of 00, O1, 10, and 11 is used to identify the residual image data 
values of -1, 0, 1, or 2 in set 0. Using Table V.1 as a reference, encoding the residual 
image data values of -1, 3, -4, and 8 would result in the code of 100011000101000111. 


Note that the length of a bit sequence associated with a particular residual data value (one 


44 


byte) will depend on tts location in Table V.1. The C high-level language reads and 
writes bytes at a time. For efficient compacting of the coded bit stream. a special C 
source code program was written that operates at the bit level. Operating at the bvte 
level would destroy any advantages of this coding method. The source code for the 
Diagonal coding (encoding and decoding) used in the thesis was written by the author 
and 1s enclosed as Appendix B. A flowchart of the source code for encoding and 


decoding ts shown in Figures V.2 and V.3 respectively. 

















—— 
Se 


Table V.1: Diagonal Coding Example. 


so) 





Diagonal coding was first used in a direct compression role to compress the three 
(est images. A comparison of diagonal coding with the other four lossless compression 
methods was performed. The results are graphically displaved in Figure V.4. The graph 


data is tabulated in Table A.4 in Appendix A. Figure V.4 1s the same as Figure ITV.7 in 


45 


Read in Residual 
Image File 


Get Pixel Value 


Determine what 
Set Pixel Value 


is In 


Output Diagonal 
Code 


Output 2-Bit ID 
Code 





Figure V.2: Flowchart for Diagonal Encoding. 
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Figure V.3: Flowchart for Diagonal Decoding. 
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Chapter 1V with the addition of the Diagonal coding results. Diagonal coding produced 
an expansion in the image file size after compression for all three of the test tmages. It ts 
clearly not a viable lossless compression method for images which do not exhibit a 
narrow Laplacian distribution (histogram) with a mean of zcro. 

Next, Diagonal coding was used tn the hybrid lossless compression model to 
compress the residual image. Each of the three test tmages were used and were 
compressed at various quality factors. A comparison of Diagonal coding wrth the other 
four lossless compression methods was performed. The results are graphically dtsplaved 


in Figures V.5, V.6, and V.7. It is observed that at low quality factors (1.e., low e._) the 


rns 


standard entropy based methods, Huffman and Arithmetic, are very competitive tn the 


hybrid model. At high quality factors (1.c., high ¢,.), the lossless JPEG tends to be the 


rns 
most competitive. It 1s noted that Diagonal coding ts very inefficient at a quality factor 
of 500. The graph data ts tabulated in Tables A.5, A.6, and A.7 in Appendix A. These 
three figures are the same as Figures IV.8, IV.9, and IV.10 in Chapter [TV with the 
addition of the Diagonal coding results. 

A comparison between the compression results achieved using direct lossless 
compression and the hybrid lossless compression model using Huffman, Arithmetic, 
LZW, lossless JPEG, and Diagonal coding to compress the residual image ts graphically 
displayed in Figures V.8, V.9, and V.10. It ts observed that, with the exception of 


lossless JPEG, generally one or more of the hybrid compression schemes will achieve a 


higher CR than its direct counterpart (see the fifth column for each method in Figures 
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V.8, V.9, and V.10). In the lossless JPEG case, the hybrid methods are fairly 
competitive to the direct lossless JPEG and even demonstrate a slight marginal CR 
advantage in the case of FINGERPRINT (see Figure V.7). In most cases. LZW 1s not 
competitve with the other lossless compression methods. The graph data is tabulated in 
Table A.8, A.9, and A.10 in Appendix A. Once again, these three figures are the same as 
Figures [V.11, 1V.12, and [1V.13 in Chapter [V with the addition of the Diagonal coding 
results. 

Other variations of Diagonal coding were tested and evaluated in an attempt to 
achieve higher compression results when compressing the residual images. One variation 
consisted of altering the number of range valucs in each set and performing Run-length 
encoding on the longer diagonal codes. This variation achieved minimal compression 
improvements and in most instances resulted in less compression than did the baseline 
Diagonal coding method. Another variation consisted of breaking up the residual image 
data into bit planes and performing Diagonal coding on them. For example. the six most 
significant bit (MSB) bit planes were combined together, and the two least significant bit 
(LSB) bit planes were combined together to form two separate data sets. Diagonal 
coding was performed on each data set and the resulting compressed files were added 
together to form an 8-bit compressed file. Different combinations of bit-planes were 
tested and evaluated; however, none achieved the compression results attained by 
performing Diagonal coding on the original 8-bit residual image file. The two LSB's tn 
the residual image are primarily noise and contribute little to the quality of the original 


image. If they are removed from the original image, no significant visual degradation 
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occurs to the image. Indeed, high compression ratios were achieved using Diagonal 
coding to compress the six MSB's: however. the process 1s not a truly lossless one and 
was therefore not included in the compression ratio comparisons. .\ representative 
sample of data produced from cach of the Diagonal coding variations is tabulated in 


Tables A.f1 and A.f2 in Appendix A. 


D. CONCLUSIONS 

Diagonal coding ts not as effective as Huffman, Arithmetic, and lossless JPEG in 
compressing the residual image: however. Diagonal coding docs achieve higher 
compression of the residual image than does LZW in most cases. Diagonal coding 
achieves close to the same compression results as Huffman, Arithmetic, and lossless 
JPEG at some quality factors. As the quality factor used to compress the original image 
is increased. the compression achieved using Diagonal coding decreases. This 1s due to 
the residual image distribution widening, thereby resulting in longer diagonal codes. At 
some point, Diagonal coding will result in the expansion of the residual image file size. 
Diagonal coding resulted in an expansion of the residual image size when used to 
compress FINGERPRINT at a quality factor of 500 (see Figure V.7). The benefits of 
using Diagonal coding 1s its case of implementation and non-complex nature. It 1s a 
non-CPU intensive algorithm with minimal execution times as compared to Huffman and 
Arithmetic. Additionally, tt achieves comparable compression results at some quality 


factors. 
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Figure V.4: Direct Lossless Compression of Three Test Images. 
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Figure V.5: CR Achieved Using the Hybrid Lossless Compression 
Model on LENA at Various Quality Factors. 
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Figure V.6: CR Achieved Using the Hybrid Lossless Compression 
Model on SHUTTLE at Various Quality Factors. 
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Figure V.7: CR Achieved Using the Hybrid Lossless Compression 
Model on FINGERPRINT at Various Quality Factors. 
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Figure V.8: Comparison of Hybrid Lossless Compression Model 
with Standard Lossless Compression Methods for 
LENA at Various Quality Factors. 
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Figure V.9: Comparison of Hybrid Lossless Compression Model 
with Standard Lossless Compression Methods for 
Sia | Pee tev onious Olay actions: 
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Figure V.10: Comparison of Hybrid Lossless Compression Model 
with Standard Lossless Compression Methods for 
FINGERPRINT at Various Quality Factors. 
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VI. HYBRID MODEL OPTIMIZATION 


As discussed in section V.B, the quality factor will impact the Laplacian 
distribution of the residual image. The results of this chapter will show that the 
compressibility of both the browse and residual images depend on the quality factor. At 
low quality factors, minimal compression 1s achieved on the browse image; however, the 
residual image becomes highly compressible. is the qualitv factor is increased, the 
browse image 1s more compressible, but the residual image compresses less. Since the 
overall lossless image 1s the sum of the compressed browse and residual image data (sce 
Equation II.2), achieving maximum overall compression would ostensibly depend on 
finding some optimal quality factor. In this chapter, we will examine this issue as well as 
the sensitivity of the overall CR to the quality factor for the images chosen. 

Figures VI.1, V1.2, and VI.3 display the overall CR achieved using the hybrid 
lossless compression model with the three test images. These three figures are very 
similar to Figures V.5, V.6, and V.7. The difference is in the way the data is displayed 
and the number of quality factors used. The graphical data 1s tabulated in Tables A.13, 
A.14, and A.15 in Appendix A. Note that the graphical results of using Diagonal coding 
to compress FINGERPRINT in Figure V1.3 are limited to a quality factor of 350. This 1s 
due to the degree of expansion Diagonal coding produces at quality factors greater than 
350 on FINGERPRINT. The quality factor used to compress the original image ranges 
from a value of 5 to 1000 so that a wide range of browse and residual images are 
produced and evaluated. The three figures show that, for high quality factors, lossless 


JPEG 1s the compression method which achieves the best CR on the test images. [n most 
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instances, Huffman, Arithmetic, LZW, and Diagonal coding achieve decreasing 
compression on cach of the tmages at the higher quality factors while the hybrid lossless 
JPEG achieves virtually the same CR at quality factors of 50 or higher. This significs 
that at higher quality factors, the hybrid model ts relatively insensitive to the quality 
factor provided that lossless JPEG is used to compress the residual image. In other 
words, the correct choice for the quality factor 1s essentially dictated by conditions such 
as browse image compresston and browse image quality, not overall hybrid lossless CR. 
The browse image becomes visually distorted and lossy at the higher quality factors. It 1s 
left up to the user to determine what a good quality browse image is for the particular 
application the hybrid model is being used. 

At the lower quality factors, Huffman, Arithmetic, Diagonal, and lossless JPEG 
achteve comparable compression ratios at different quality factors for the three images. 
The choice of which lossless compression method to use depends on the user's 
requirements for complexity and compression/decompression time. LZW does not 
appear to be a wise choice for lossless compression tn almost any case. 

The advantage of decomposing the original image into a browse and residual 
image is the reduced compressed browse image file stze compared to the direct lossless 
compressed image file size. Figures V1.4, V1.5, and VI.7 display the browse and residual 
image compression ratios, and corresponding overall hybrid compression ratios, for the 
three test images at various quality factors. The lossless compression method (Huffman. 
Arithmetic, LZW, lossless JPEG, or Diagonal coding) that produces the highest overall 


hybrid CR 1s the one that ts graphed. The best direct lossless JPEG compression ratio ts 


> 


graphed for comparison. The graphical data ts tabulated tin Tables A.16, A.17, and A.18 
in Appendix A. As the quality factor increases, the browse CR decreases and the residual 
CR increases as expected. In all cases. the browse CR 1s significantly greater than the 
direct lossless CR. At quality factors of 100 or less, all three test images are visually 
lossless. .\ comparison of the browse CR with the direct lossless JPEG CR for quality 
factors of 100 or less (see Figures V1.4, VI.5, and VI.6) demonstrates the advantage of 
decomposing the original image into a browse and residual image (1.¢c., a visually lossless 
browse image 1s produced which has a significantly higher CR than the direct lossless 
IP EG 

The highest overall hybrid CR was achieved using Arithmetic coding at quality 
factors of 5, 50, and 100 for LENA in Figure VI.4. At quality factors of 500 and 800, 
lossless JPEG was used to compress the residual image. 

The highest overall hybrid CR was achieved using Arithmetic coding at quality 
factors of 5 and 50 for SHUTTLE in Figure V1.5. Lossless JPEG was used to compress 
the residual image at quality factors of 100, 500, and 800. 

The highest overall hybrid CR was achieved using Arithmetic coding at quality 
factors of 5, 50, and 100 for FINGERPRINT in Figure V1.6. At quality factors of 500 


and 800, lossless JPEG was used to compress the residual image. 


The results indicate that for low qualitv factors (<50) Arithmetic coding 1s the 


best choice for lossless compression of the residual images while at higher quality factors 


(>50), lossless JPEG 1s the best choice. 
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Figure VI.1: Hybrid Lossless Compression of LENA at Various 
Quality Factors. 
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Figure V1.2: Hybrid Lossless Compression of SHUTTLE at Various 
Quality Factors. 
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Figure VI.3: [1ybrid Lossless Compression of FINGERPRINT at 
Various Quality Factors. 
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Figure V1.4: Browse and Residual CR Comparison with Direct 
Lossless Compression for LENA. 
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Figure V1.5: Browse and Residual CR Comparison with Direct 
Lossless Compression for SHUTTLE. 
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Figure VI.6: Browse and Residual CR Comparison with Direct 
Lossless Compression for FINGERPRINT. 


Vil. CONCLUSIONS 


An analysis of the results of Chapters IV, V. and VI indicates that the proposed 
hvbrid lossless compression model has merit as a lossless image compression method. 
With the exception of lossless JPEG, the substitution of the other lossless compression 
methods (Huffman. Arithmetic, LZW. and Diagonal coding) into the hybrid model 
produce compression results that generally outperform their direct compression 
counterparts. The decomposition of the original image into browse and residual images 
gives an end-user the ability to browse an image and determine whether the residual 
image should be transmitted and added to the browse image to reproduce the original 
image. This feature is not available with any direct lossless compression method. The 
quality of the browse image and the overall compression achieved are determined by the 
quality factor used to compress the original image using lossy JPEG and ts a user 
controlled variable. The better the browse approximates the original data, the more 
compressible 1s the residual image data. Thus, a better quality browse results in a 
residual that can be compressed better in lossless mode. However, a better quality 
browse results in a larger browse image file size. The key factors are to select a quality 
factor which produces a visually acceptable browse image and a lossless compression 
method that achieve the best overall compression. 

The results show that LZW 1s not a lossless compression method which should be 
used to compress the residual image. The residual images do not contain long repetitive 
strings of pixel values which are necessary for LZW to achieve high compression results. 


This 1s not surprising since the LZW method ts designed primarily for compressing text, 
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not visual graphics [2, pp. 23-24]. Dragonal coding ts a viable candidate for lossless 
hybrid compression at lower quality factors. .\s the quality factor increases though, 
Diagonal coding results in poor compression and eventually even expansion of the 
residual image file size. Huffman and Arithmetic achieve comparable compression 
results at all quality factors. At the lower quality factors, Hluffman and Arithmetic do as 
well as or better than lossless JPEG in most cases: however, lossless JPEG ts the prime 
choice for lossless compression of the residual image at higher quality factors (1.e.. high 
c,,.). Under these conditions, the JPEG predictor ts better able to accurately predict pixel 
values for all residual image distributions resulting in higher compression ratios. This 
ostensibly ts a result of a higher 2-D correlation of pixel values within the corresponding 
residual images and, consequently, facilitates compression in the JPEG lossless method. 

Future areas of research include the classification of tmage types so that the 
optimum or nearly optimum combination of quality factor and lossless compression 
method may be selected which produces a visually acceptable browse image and the 
greatest overall compression ratio. Unfortunately, quality factors or rms error parameters 
are not perfect indicators of subjectively evaluated image quality. Until such an indicator 
exists, 1t appears that producing a general guideline for selecting a lossless compression 
method applicable to all images (in general) may not be possible. 

Another option is to evaluate the hybrid lossless compression model using fractal 
Image compression as the lossy compression algorithm. This has been pursued using a 


combination of lossy fractal compression and lossless LZW: however, other lossless 


6] 


compression methods were not reported to have been tested in their hybrid model [4]. 
For the data presented, tt appears that although the combination of lossy fractal and 
lossless LZW compression produced a lossless replica of the orrginal tmage, the overall 
CR achieved using the hybrid technique resulted in an expansion of the tmage file size. 
Compression was achieved only when the number of grev-scale values was Itmited (L.¢.. 


representing pixels using less than 8 bits). 
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APPENDIX A 
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Table A.I: Comparison of Quality Factor vs CR for the Three Test Images. 
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Table A.2: Comparison of Quality Factor vs e_. for the Three Test Images. 
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Table A.3: Best JPEG Algorithm (K) for Test Images at Various Quality Factors. 
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Compression Type LENA SHUTTLE FINGERPRINT 
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Table A.4: Comparison of Lossless Compression Methods on the Three Test Images. 
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Table A.5: CR Achieved Using Hybrid Lossless Compression Model on LENA at 
Various Quality Factors. 


Compression Q=50 Q=100 Q=500 
_—— 


Huffman | eos ees 5 tl SF 8 AS, oo | oo 7 


Table A.6: CR Achieved Using Hybrid Lossless Compression Model on SHUTTLE at 
Various Quality Factors. 
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Table A.7: CR Achieved Using Hybrid Lossless Compression Model on 
FINGERPRINT at Various Quality Factors. 









ef oe fw mar 
Type aa 
Huffman | Sa 

times [ina a0} a0 | srr 


Table A.8: Comparison of Hybrid Lossless Compression Model with Standard 
Lossless Compression Methods for LENA at Various Quality Factors. 
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Table A.9: Comparison of Hybrid Lossless Compression Model with Standard 
Lossless Compression Models for SHUTTLE at Various Quality Factors. 
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Table A.10: Comparison of Hybrid Lossless Compression Model with Standard 
Lossless Compression Methods for FINGERPRINT at Various Quality 
Factors. 













Table A.I1: CR Achieved Using Diagonal Coding and RLE Variation in Hybrid 
Model. 


Quality Factor LENA SHUTTLE FINGERPRINT 
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Table A.12: CR Achieved Using Diagonal Bit Plane Coding Variation in Hybrid 
Model. 
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Table A.13: Hybrid Lossless Compression of LENA at Various Quality Factors. 
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Table A.14: Hybrid Lossless Compression of SHUTTLE at Various Quality Factors. 
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Table A.15: Hybrid Lossless Compression of FINGERPRINT at Various Quality 
Factors. 
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Table A.16: Browse and Residual CR an for LENA. 
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Table A.17: Browse and Residual CR Comparison tor SHUTTLE. 
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Table A.18: Browse and Residual CR Comparison for FINGERPRINT. 
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APPENDIX B 


This appendix contains the source code for the Diagonal coding lossless 
compression method. The files Bitio.c, Bitio.h, Errhand.c., and Errhand.h are adapted 
from Nelson's text source code. The files Browse-c.c and Browse-e.c are written by the 
author. The programs are written in C and were compiled on a Sun Workstation with the 
GNU C compiler. The command to compile and Iink the programs 1s: 

gcc Browsc-c.c Brtto.c [irrhand.c -lm -o Browse-c 
This will result in a program called Browse-c which will encode an 8-bit 256x256 
grayscale image using Diagonal coding. Substitute Browse-c.c into the compile 
command to produce the program which will decode a Diagonal coded compressed 


image file. 
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Dec 10 05:40 1993) Srewse-—C eer ago 
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* browse-c.c: Program that performs Diagonal coding (encoding) on a 2o@e ee 


* To compile: g¢c browse-c.c Ditio. ec errhand.c —lm-Geercvs-—= : 
Os eae browse-c [input image file] [output image file] : 
=) AUCHOE: Doug Abbott ‘ 


HK KK RR KE Ke RRR E KKK KK RRR KR HK ROR RR RR 


#include <stdio.h> 
#include <stdlib.h> 
#include <math.h> 
FAneclude “Sit Tre. i. 
#include "“errhand.h" 


#define ROWS 256 
+#define COLS 216 


tifdef STDC __ 


vold Compressfrile( P1GE.“anpue eS tee Lr. Ole OME 

void ReadinputFile( FILE “inpuc,” Bil Pree *ousere ier ae 
int OutputCode (S17 FILE “outpur Trle, ier cia. 

void print. Yratwos( char “29 pul, -ebeieeur Urs, 

leng Eilers rzc« -chaias Manicws)), 


#else 


void CompressFile(); 
void ReadInputFile(); 
int OutputCode(); 
VOI0 DiI neat Looks, 
Hong Cube size); 


#endif 


main( argc, argv ) 


DAL sake, 
Ghat Gabon ie, 


{ 


Eee erie, 
Bit,h PLE cuteue, 


input = ftopen @argviltl. “so 
if ( input == NULL ) 
fatal error{ “Error opening 75 -rOrwIneur mn Jane: (alee 
output = OpenOutputBitFile( argv[2] ); 
1f ( output == NULL ) 
fatal error( "Error opening 25 f6r Outpur Wie oo wee 
Compressrile( anpuc, cutpuc )] 
CLiSGSecOutcputcBitrile (foutpue. 
folose( inpues); 
DEINC Fatios( argv[i arguilaiag 
BEL ULC) 


LZ 


mm OD:40 1993 browse-c.c Page 2 


id CompressFile( input, output ) 
m ~input; 
mc iLE “output; 


MenpuLrPile( input, output ); 


merneadinputFile( input, output _file ) 
= *~input; 
me iLLE *output file; 


= row; 
mcol; 
mc, VY; 


Read in the input file to be compressed. */ 


m( row = 0; row < ROWS; rowtt+ ) { 
mer ( col trem@ow. = COLS; -coltr ) { 
Seemoetc( input ); 
if ( c == EOF ) 
Pieeerror( “Error reading input grey scale file\n"” ); 
VeeeeuepurCode( output file, c ); 


{| “=e 


Purge the mask of any remaining bits. */ 


memeucmats( output file, 25/L, 1 ); 


Function which determines which range set the pixel value is located in 
and its location within the range set. The appropriate diagonal code 
and identification value are output. 


moutputCode( output_file, code ) 
ous “Output file; 
= code; 


Mivmcoo ot range, bottom of range; 
ite olt count, count; 


Bop Of range = 127, 
pottom of range = 128; 
meencount = 0; 

Bount = 0; 


Determine which range set the pixel value is located in. +*/ 


meme code > 127 ) { 
ties ( COde > top of range ) { 


aS 


Dec 10° U5S:40 1993 “orewse-C. Gerace. 


/* 


/* 


} 


7 


DEE mCOUnE tay, 
count t= 1; 
top of Tange = top cf Tange, 


} 
ae OC Cie 
while < code < boCtcm of eangem ra 
DEEwCount t+; 


COuncen=~ ky 
Dettom. Of Lange. = {Dect om Of Langer... 


Output diagonal code. */ 
OUTDUEBIES( OCUCDUL eile, liye bite ecounes). 


Determine the location of the pixel value within the range set and */ 
output the two identification bits. +7) 


Wee TNelsioley GS ee 


Lio Code == Perron orelanges, 
CutpucBiGs( outpuc file te) 
eitse 


QuCpucBits ( CutpUG Eile es eee 
} 
LE (eCOodeee a2 a). 41 
if ( code == top of range ) 
Suepuc bits (Output ti le eel ow 
else 
CUEDMIE BITES (rove pur eds le ee ey 
} 


reticn, -—© count); 


Determine the size of the input and output files (in bytes). */ 


#ifndef SEEK END 
#define SEEK END 2 
tendif 


Tong eile ts izes names) 
char *name; 


{ 


LOnG eo hate cis 
PILE fae: 


file = fopen( name, "r" ); 
ie (file == NUE ie) 


return(- 0.) 


fseck Gai le, 0 she hea, 
cOferec eee (es meilow ye, 
EGIO Se havkem.)s 

(oe Geri eo apc tele, 


} 
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Compute the compression ratio achieved. */ 


me Print ratios( input, output ) 
m “input; 
em *output, 


m input size; 
m output size; 


mratio; 

emeeece— file size( input ); 
Meimput Size == 0 ) 

enput size = l; 

Seemsize — tile size( output ); 
MmoubputL size == 0 ) 


Seeput size = |i; 
meme seor— (int) ( output size * 100L / input size ); 


mtf( “\nInput bytes: Sid i ampuE Size); 
Mmrtf( “Output bytes: sidin 7 OuLpUL size )5 
mee () Compression ratio: %d%%\n", ratio ); 


i> 
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[EEK KERR KEKE KREKREKEKEKEKREEKKEEKKEKKKERKKEEKKKEKKERKKEKEKR KER KA KK KKK eK AK Ae 


*  browse-e.c: 
* 

*~ To Compile: 
ms Dear wan 

< Ruchor: 


Program that performs decoding of a compressed image file 


that has been encoded with Diagonal coding. 

gcc browse-e.c bitio-.c errhand.c —-1m —On2ro (cc 
browse-e [input image file] [output image file] 
Doug Abbott 


* 


* 


* 


* 


CREE EREKREEE KEK EH RRE RHR EK KEE RK AR RK Ee Re 


#inelude <stdie.,h> 
#include <stdlib.h> 
#include <math.h> 
+oncmude.- bat tonne: 
#include "errhand.h" 


#define ROWS 
#define COLS 


#ifdef STDC _ 


Zoo 
Za0 


vold Expandrile( BIT BILE “1npur, 2 ies OuG pica» 

void Keadinputhile( Bil bir 2 inure ULE VOU eee wee, 
Int Inpueéede (Bit rls np it eielow 

llenG 2h Vews Ze (char = famems, 

vold print ~filesize( cha: “1npur clas our Soue 


#else 


void ExpandFile(); 
void ReadInputFile(); 


int. LipuEeoceqw)., 


long vite saize7, 
void print filesize(); 


Fongani 


main( ange, argv.) 


Pie wclaeie, 
ehatevaco wu 


{ 


FILE *output; 


BIT FILE *input; 


input = Openinputsickile( arewl lit»; 
1f ( input == NULL ) 

fatal error( “Error opening *%S for input m7 yang alee 
OUEDUE = fLopen{ argqv|(zZi, “wh; 


if o( CutpuE == 


NULL ) 


fatal erron( Bxeror Opening 2S ECs ouupuE momar en 
Expandbhile( iiipuwt, | cucouiE.). 
CloseInputBitFile( input ); 


Telose (output 


); 


Perper size (tial ta cig 12 ee 


return (.05)7; 


} 
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Me expandFile( input, output ) 
SeelLE *input; 
me ~OuUtput; 


minputFile( input, output ); 


Seeeecadinputrile( input, output file ) 
me ILE *input; 
m= “Output file; 


row; 
aol; 
igned char amplitude; 


LO? ie Oe 


Read in the input file to be decompressed. */ 


m( row = 0; row < ROWS; rowt+ ) 
Reeeemecol = 0; col < COLS; col++ ) { 
amplitude = InputCode( input ); 
peeereamplitude, output file ); 


Function which decodes the compressed image file. */ 


Selapuecode( input file ) 
othe anput file; 


mbit count; 

m result; 

signed char amp; 

® top of range; 
meoottom Of range; 
= count, 


mot range = 0; 
mt = 1; 


Get bits from input image file. */ 
Beemer — tint) InputBits( input file, 1 ); 
Megkamg ©£Or a bit ’1’. */ 
Bie ( bit count =-0) { 
eount++; 
Simeecemne — (int) InputBits( input file, 1 ); 
} 


Determine what range set the decoded code belongs to. */ 


eeemeeange = 130 + (( count - 1) * 2); 
Seemeoeerange = 12) — (( count - 1) * 2); 


iy 
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/* Get next two bits from the input data. These are the two identification */ 
[= aaa Ss. * / 


result-= (int) [npubBt ce (erie ena nee ee. 
/* Determine what the decoded pixel value is. */ 


LE (result sei) 

ani Cab OUmOr aranGe t= so) hese, 
else 

emipe= boltlom ol erandge ao eres le, 
PSuUrn (vamos); 


} 


/* Determine the size of the input and output files (in bytes). */ 


f#ifnder SEBRK END 
#define SEEK END 2 
erie sate 


long (fate si ze4e nance) 
char *name; 

{ 

long eof teell, 

FILE *file; 


file = fopen( name, "r" ); 
if ( file == NULL ) 

return( OL ); 
peeek (file, (Ui oh ihe NDS): 
Sone Tee Sete iI seal ee 
Fclose( file ); 
BeEUTH( COL etter), 


} 


VOUd /Dmant Veelesihe> (at NOME aCe Olina 
Char amp ur, 

Glee “OUEPUL, 

{ 

Long 2Anpueystze, 

TONG sOUrPuienss ze, 


Pit SZ ee seiebe gS ihZer aioli e 
Le (Wesbolonea mehr As R= 10) 
sbaheylic  isaleass sl 
QUCPUL Size — £itevei Ze @enLouLe) | 
Pe Cutotemst ae sa) 
OULPUE. S17 jae) 
Prinktl(s  \ninpuerovtes: sLa\N"; inpuieeisdkze ); 
DLIntt(  Cuteuts ovees. old a OulepUE Gs -2e ee 


} 
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DEK KKK KKK KKK KKK KK KKK KK & Start Oe BiTilo Cc eA ote ie A a, ok ke a aie ek ei ke, ie ik a eke 


This utility file contains all of the routines needed to implement 
bit oriented routines under either ANSI or KéR C. It needs to be 
Jinked with every program used in the entire book. 


f 


mlude <stdio.h> 
Mmlude <stdlib.h> 
miude "bitio.h” 
iclude "errhand.h" 


fine PACIFIER COUNT 2047 


me ilLe *OpenOutputBitFile( name ) 
ir *name; 


SaterriE *bit file; 


Pubeeeies — (BIT FILE *) calloc( 1, sizeof( BIT FILE ) ); 
meeodt: file == NULL ) 
meme ( bit file ); 
Sime tte->tile = fopen( name, "wb" ); 
Piel e>>rack 0; 
Bae tele—>mask 0x80; 
Pele -bacifier counter = 0; 
Getumm Obit file ); 


meiLE ~OpeniInputBitFile( name ) 
= “name; 


eit FILE onc, file; 


puleeeter— (bil FILE *) calloc( 1, Sizeof( BIT FILE ) ); 
Peecsott tile == NULL ) 
metmimen( O1it file ); 
ee ee te->fLile fopen( name, "rb" ); 
ee tile—>rack 0; 
Spent ile—-mask = 0x80; 
Piece =>pacifier counter = 0; 
Beeurm@ Olt file ); 


m CloseOutputBitFile( bit file ) 
Meine “bit file; 


Bemm@eet file—->mask != 0x80 ) 
Mima cOlEntile—>rack, bit £ile->fileé ) != bit file—>rack ) 
Seamer oGte aLalterror in CloseBitFile!\n" ),; 
Meme semoit file->file ); 


Beeewmeciiar *) bit file ); 


meerosecinputBitrile( bit file ) 
mr ILE *bit file; 


79 


Dec [0° 05217 1993 Ss biltie.6. Pages 


fclose( Dit_file—> imi), 
free ( (cha *) Dar aeelensy, 
} 


VO1d OULDUEBIE( DST ESE le oie.) 
Bit ei he beri re, 


cine ong 
{ 
Diet) 
Dit St vle=- rack «|= oi tree Mek, 
Date ee Mido eer, 
Lf ( bite tile—>mask=——= 0) 4 
if ( pute( bit. file->rack, bat file> tile eee eee 
fatal error( "Patal errougin, OuLpUuER IE ee 
else 
1£ ( ( bit £ile->pacifier counters? & PACIR Wan seeul i) -— eee) 


PULG( 4. } Ssrcour yy, 
batter lo—>racke— U, 
Dart 7 Pele mask seal, 


} 


void, OULDUEBIES( Dit filers code, Gouilers) 
Bitar OLE bie finle, 

uns _onedelong code, 

Ine veCoutte, 


{ 
unsigned long mask; 


mask —~ lle << wG@ncouniae——) )\- 
if ( code == 257 ) 
PUEG Dl DEP leo 7 rack gore fle] pieterg). 


else { 
while ( mask != Q) { 
if ( mask & code ) 
Dit fileq=>racks (> sbten tele mas, 
bit file->mask >>= 1; 
if ( bit _file->mask == 0 ) { 
if ( putc( bit _file->rack, bit vfile->file)  !— eee iile eee 
fatal, error() "Fatal error in OuteiEr ie ee = 
else if ( ( bit _file->pacifier countert, & PACTRIER COUN) | me 


DUCC(= 2 7= cedoul )); 
Dite.1 le--rack™ — 0, 
bit: blie=-masky—s0xo0,, 
} 


mask >>= 1; 
} 
1G Ent Bite (aise eee) 
Bee eee Da ea ey, 


{ 


int value; 
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fe ( bit file->mask == 0x80 ) { 
ampere —>rack = getc( bit file->file ); 
Mimmpe@eoit file->rack == EOF ) 
Boece ctrort. Fatal -error in inputBit!\n" °); 
Sepeeeree Fi le—>pacifier countert+t+ & PACIFIER COUNT ) == 0 ) 
pumet  .°, stdout ); 
} 
feaeew—sort file->rack & bit file->mask; 
bit file->mask >>= 1; 


omeere t1le—->mask == 0 ) 
fee ti le—->mask = 0x80; 
Perlrmt value ? 1: OQ ); 


meanea 1Ong InputBits( bit file, bit count ) 
meet *bit file; 
meoit count; 


unsigned long mask; 
Mgetgied long return value; 


Peete count == 7/5 ) { 

ieee rteetileé—>mask == 0x80 ) 
Omen erle—->-mask = 0x02; 

else 

Deore rr le—>mask == 0x40 ) 
pater te-—>mask = 0x01; 

else 

Peeorem:l le—>mask <= 0x20 } 
beet i le—->mask <<= 2; 


ime ott f£ile->mask == 0x80 ) 
mag — 1; 

} 

Pesteeeti << ( bit count — 1 ); 


cAereuri value = 0; 
while ( mask != 0) { 
Moeote ti le—>mask == 0x80 ) { 
Goemtetle—->rack = getc( bit file->file ); 


if ( bit file->rack == EOF ) 
Raitclwerson( “fatal Error in InputBit!\n" ); 
Mepeeonole f£ile—->pacifier countert+ & PACIFIER COUNT ) == 0 ) 


PUEei 6. 7 StLGOUL ); 


} 

Zemmepoie tlile—>rack & bit file->mask ) 
PeEwrn value |= mask; 

Maem >>= 1; 

bit file->mask >>= 1; 

miieott £ile->mask == 0 ) 
Pree tiltes-mask — 0x30; 

} 


Powdem return value ); 
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void FilePrintBinary (Ville = codes olccm: 


PGi ec, 
UNnSIOGNed. 1newcede. 
it Des; 

{ 


unsigned int mask; 


maske= le<< OC bites =. Vay, 
while ( mask != 0) { 
if ( code & mask ) 
LOU Cy 4k lew 
else 
Powe (Oe se eion 
mask >>= 1; 


} 


[O&K Re kee te eke ete ke oe eke ke oe eke ke te ke ake End One 


Biro 2G 
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mk kek kkk Kk Keke Kk Ke Ke KK Kk Kk KKK KEK Start Gite BITILO H 8 SEE ot Nella aaa Fe Re Dial ol Na alla ie oleae 


mdef BITIO H 
mine BITIO # 


iclude <stdio.h> 


Meer Struct bit file { 
PELE *~fLile; 
unsigned char mask; 
mnt rack; 
Miteedei tier counter; 


BT FILE; 
met STDC 
SF ILE Pooper InpucBitrile( char *name ); 
SFILE *OpenOutputBitFile( char *name ); 
d Raat: ( Sit PILE *bit file, int bit ),; 
‘al Siepucbits( BIT FILE *bit file, 
nniswoned longecode, “Int count 7); 
d StiekwOBItSs( BIT FELE *bit fale, 


HMcsEgied Ong “COde sine rCOUnE -); 
, Petit (SIT FILE *bit file ); 
Be -omremo INputBitsS{ BIT FILE *bit file, int bit count ); 


dd eee puLbIthrle( BIT FILE *bit file ); 
i MioscOucpDULBILPile( BIT FILE *bit file ); 
fl FPilePrintBinary( FILE *file, unsigned int code, int bits ); 


Semen oTDC */ 


BrILe *OpenInputBitFile(); 
er TLE POPeCHnOULPULCBItFile () ; 
d PiepuLcBit () ; 

d SiepucBits () ; 

d Clie TWOB1ES (); 


Pipes t (); 
mgned long InputBits(); 


d CloseInputBitFile(); 
a CloseOutputBitFile() ; 
d FilePrintBinary ({) ; 


mee CTDC */ 
mit /* BITIO 8 */ 


chew Keke keke keke kkk hh Kh Kh Khe Kh hk Kh Kh Kk KKK End ‘one BLLeLo H RA ERK RAR RE RR RK KK Xe / 


oo 


béc 10505730 2993 cririand seme agden. 


[ BRIER ERIK ARRAS Ker ee Start of ERRHAND cr KaKKKKKEKKEK KEK KKK KK KKK KEKE KKK 


x 


* This is a general purpose error handler used with every program in 
[ST EMe seOC kK. 
ey 


#include <stdio.h> 

#include <stdlib.h> 
#include <stdarg.h> 
tinclude.. e€rrhnand. i" 


7 LEIS Sth Cam 
vOld Tatal erreg( char “ct .aeerae 
#else 
tifdef UNIX__ 
VOI0 Fatal lerror (imu vaca ise, 
Chak. ~ iM: 
Va 0c! 
telse 
Zopue Baw herclibgishai song (a mele ® } 
char “ime; 
sendif 
tendif 
{ 
Vast ergpin, 


va _start( argptr, fmt ); 
PrIinté() “Patalwerrer er, 
Nigoranbqnessd( janihers cuaegeiee })s5 
Veen eave a., 
Oxi (ei. 

} 


[RRR KKK RK KK RRR RK RK KK KK RK Enda Que BRRHAND KKEKKEKK KER ERA EK © AK RX Oe ee 
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Kaew ake keke keke keke Kh Kha hehe Kk Start Ge ERRHAND H 2 eee RS ee AA Ee ae / 


fndef ERRHAND H 
2fine ERRHAND H 


Fdef STDC __ 
Meeecteserror( char “fmt, ... )j 
se fee tDC */ 

@ fatal error(); 

Bit /* stTpc */ 

dif /* ERRHAND H */ 


ewe ae kak ahh keke kkk kaa ke aKa End on ERRHAND H RRR A RN ARH AIR KK AK ee Ree A / 
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